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[57] ABSTRACT 

The performance of a computer system which use reduction 
of clock speed to conserve power is enhanced by dynami- 
cally adjusting the minimum number of clock cycles 
required for memory access ("wait states"). When the com- 
puter system decreases its clock speed, the minimum num- 
ber of wait states is decreased to account for the longer cycle 
time. Likewise, when the computer system increases its 
clock speed, this invention determines whether any increase 
in the minimum number of wait states is required, and if so, 
implements such an increase. 

7 Claims, 3 Drawing Sheets 
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METHOD AND APPARATUS FOR 
PERFORMANCE OPTIMIZATION IN 
POWER-MANAGED COMPUTER SYSTEMS 

FIELD OF INVENTION 

The present invention relates lo reducing power consump- 
tion in computers, and in particular to a method and appa- 
ratus for controlling the number of memory access wait 
states with changes in the system clock speed. 

BACKGROUND OF INVENTION 

Portable and lap-top computers have become widely used 
and are increasing in popularity. An advantage of such 
computers is that they can be operated with the use of 
batteries without the need of an external power supply. One 
limitation faced by these portable computers is that their 
batteries have a limited life. Atypical battery pack generally 
needs to be recharged after every two (2) to four (4) hours 
of use. Hence an efficient power management scheme is 
essential in order to allow these computers to run as long as 
possible without requiring the recharging of their batteries. 
Power management has also become an issue with respect to 
desktop and floor-standing computers because it is increas- 
ingly important to build environmentally friendly computers 
which consume less power. In addition, reduced power 
consumption also lowers heat dissipation and increases the 
reliability of a system. 

A commonly used scheme for managing the power of 
computer systems involves the reduction of the processor 
clock speed during periods of inactivity or on a continuous 
basis when a computer system is operated under battery 
power. With such a scheme, the performance of the system 
invariably suffers because fewer instructions are performed 
per second. 

The clock speed is only one factor upon which the 
performance of a computer system depends. Another impor- 
tant factor affecting the performance of a system is the speed 
at which the microprocessor reads from and writes to system 
memory. Microprocessors generally operate at clock speeds 
that are much higher than the rate at which memory can 
supply or receive data. Therefore, a microprocessor must 
wait whenever an operation requires a read or write to 
memory. The number of clock cycles that the microproces- 
sor must wait is described in terms of "wait states/' and for 
any particular system the number of wait states that will be 
used is programmed into the system's memory controller by 
the system's basic input and output system ("BIOS") firm- 
ware. 

In many computer systems, performance is enhanced 
through the use of a high-speed memory cache, external to 
the microprocessor, that minimizes the performance impact 
of using slower memory, such as DRAM, for the system 
memory. However, in many power-managed computer sys- 
tems such as portables^and lap-tops, an external high-speed 
memory cache is not employed due to power, space, or cost 
limitations. This means that the performance of such a 
system is largely determined by the number of wait states, 
i.e. the number of clock cycles that is required for the 
processor to read from and write to the system memory. 

In current computer systems, the number of wait states 
remains fixed during the operation of the system and is 
determined based on the minimum number of complete 
clock. cycles (or partial clock cycles in some specialized 
systems) that are needed to meet the worst-case access 
timing of the system memory subsystem when the micro- 
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processor is running at its maximum speed. The BIOS 
firmware is designed to program the memory controller to a 
fixed number of wait states. This number is determined by 
the system designer and is based on the maximum clock 

s speed of the microprocessor and the worst-case access time 
scenario for the memory so as to ensure that only the right 
data is read from and written to memory. For instance, the 
length of each clock cycle of a microprocessor which runs 
at 50 MFTZ will be its reciprocal, which would be 20 
0 nanoseconds. Assuming that the memory requires 70 ns for 
an access, the memory controller would have to be pro- 
grammed for three wait states (i.e., the processor would wait 
three cycles and then receive the data on the fourth). This 
results in a total of 80 ns for the memory access since the 

J5 processor must do these operations in complete cycles. 
In existing power-managed computer systems, the num- 
ber of wait states remains the same regardless of the clock 
speed of the processor. For instance if the clock speed of the 
processor is reduced from 50 MHz to 25 MHz (i.e., the cycle 

20 time increases from 20 ns to 40 ns), since the number of wait 
states remain unchanged (i.e., three wait states), memory 
access will now take 160 ns instead of 80 ns even though the 
access time of a memory device does not change with clock 
speed. In other words, the data will be available after 70 ns 

25 and the processor will wait 90 ns after the data is available 
before it will actually receive the data This results in a 
substantial degradation of performance. 

In one proposed solution for conserving power, the use of 
a set of system/memory controller chips allow instantaneous 

30 clock speed changes. Specifically, this method involves the 
lengthening of the processor clock cycle, up to the amount 
necessary to allow for the access of system memory which 
provides data independently of the processor clock. This 
allows the system to be configured to exactly match the time 

35 it takes for the processor to access the system memory to the 
system memory subsystem's worst case access time. 
However, such a system requires a processor that supports 
instantaneous clock speed changes, which is not widely 
available. In addition, the system memory timing is still 

40 treated as a constant when the processor clock speed is 
reduced. 

Another approach for power management is to process a 
task at maximum clock speed, so that the task will be 
completed in the least amount of time, and then to reduce the 

45 clock speed to save power during idle periods. For instance, 
consider the power consumed in one second in a scenario 
where a task would take a computer system operating at 
maximum clock speed x, 100 milliseconds to accomplish. 
Under this first approach, the system would initially operate 

so at its maximum clock speed of x and complete the task in the 
shortest possible time, which in this case is 100 ms. 
Afterwards, assuming that the system does not have to 
immediately perform another task, the system would go into 
a low power state at one-half the clock speed, x/2. At this 

55 slower clock speed, the system remains idle for the follow- 
ing 900 ms which results in lower power consumption. With 
this method, the goal is to provide the user with good 
apparent system speed, while using less total power than if 
the clock speed remained fixed at maximum speed over the 

60 entire second. 

Yet another approach for power management is to reduce 
the clock speed when the computer is in a power conserva- 
tion mode. For instance, a computer system may allow the 
user to set the clock speed to a reduced rate such as one-half 

65 the maximum speed, x/2, in order to save power and thus 
increase the battery life. However, once the system clock 
speed is reduced to one half of its maurur, depending upon 
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the specific task the system is trying to achieve, the pro- FIG. 3 is a block representation of a DRAM control 

cessing time may double. In other words, a task which would register and selected values in certain bit positions of the 

take 100 ms if the processor is running at maximum speed register. 

may take 200 ms after the clock speed has been halved and FIG. 4 is a flow chart illustrating the steps to be performed 

would consume less power. At the end of the 200 ms, the 5 m order to modify the clock speed in accordance with the 

system would then go into an idle state for the rest of the present invention, 
second which would be 800 ms. Generally the apparent 

reduction of performance will depend on the type of appli- DETAILED DESCRIPTION OF PREFERRED 

cation being used. Computation-intensive applications will EMBODIMENTS 

obviously suffer the most performance penalty. The tradeoff 30 ln the following detailed description of the preferred 

is that while less total power may be consumed m this case embodiment, reference is made to the accompanying draw- 

than operating at maximum speed, the user will experience ings which fonn a part hcrcof) and m which ^ shown by way 

serious performance degradation since each individual task of ui^non specific preferred embodiments in which the 

will take looger to complete. invention may be practiced. These embodiments are 

Accordingly, there is a need for a power management 15 described in sufficient detail to enable those skilled in the art 

scheme which achieves power conservation with minimal t o practice the invention, and it is to be understood that other 

performance degradation. embodiments may be utilized and that structural, logical and 

BRIEF SUMMARY OF INVENTION electrical changes may be made without departing from the 

spirit and scope of the present inventions. The following 

The present invention achieves significant performance 20 detailed description is, therefore, not to be taken in a limiting 

advantages by controlling the number of clock cycles used sense, and the scope of the present invention is defined by 

for system memory access in power-managed computer the appended claims. 

systems that employ processor clock speed modification. A Numbering in the Figures is usually done with the hun- 

memory controller is reprogrammed to the optimal number dreds and thousands digits corresponding to the figure 

of wait states for any given processor clock speed, when the 25 number, with the exception that the same components may 

clock speed is changed. In one embodiment, when the clock appear ^ multiple figures. Signals and connections may be 

speed is reduced from 50 to 25 Nm (20 ns to 40 ns clock referred to by the same number or label, and the actual 

cycle time), the memory controller is reprogrammed to meaning should be clear from the context of use. 

instruct the processor to wait one cycle before attempting to FIG. 1 shows a basic personal computer system at 110 

receive data retrieved from memory This results in the 30 ^ & ^ ^ (cpu) m me ffl 

processor receiving the data in two cycles, i.e 80 ns instead m m controller and core logic 116, and a basic 

of 160 ns. In other words, the new number of clock cycles input/output system ^ l0s) R0M 118. A programmable 

used is the lowest number possible based on the new clock tem dock m idcs ^ ^ to a , lcast thc cptJ 

speed and the worst^ase access time of the system memory m and the memory controller t0 synchronize devices 

subsystem fof transfcr of data CPU 112 ^ coupled to the memory 

One benefit of the present invention is that the system controller 116 by a control bus 124, address bus 126 and data 

clock speed may be reduced to save power, yet the system Dus \28. 

performance observed by a user remains similar to that when Mcmo controller 116 in onc embodiment is an appli- 

the processor is running at its maximum clock speed. Even catioQ ific int ted circuit (ASIC) t number 

if some performance degradation occurs, it is usually much 82C 465MV from Opti Corporation. It is coupled to the 

less than in the case where the number of wait states remains DRAM 114 by data bus 128 and a memory address bus 130 

constant during clock speed reduction^ It thus provides the m h which u ^ daU from ^ DRAM U4 amJ 

user with a system with acceptable performance while at the provides fefresh ^ als tQ ensure ^ the data m DRAM is 

same time conserving precious battery power. ^ refreshed and does nQt djssipate ^ e memory coatroller n6 

In one embodiment, prior to setting the clock speed to a ^ a iso coupled to BIOS 118 via Industrial Standard Arcbi- 

higher value, the memory controller first resets the number tecture bus ISA Data bus 140 and ISA address bus 142. 

of wait states. This is done when power conservation is no Memory controller 116 also provides for control of the ISA 

longer desired. It ensures that the processor will wait long DUS v ia an ISA control bus 144. 

enough for the data when increasing the clock speed. 5Q Memory ^^Het U6 conlains several registers for 

In a further preferred embodiment, the clock speed is storing system configuration information. In particular, sev- 

reduced during times of inactivity. The system still performs era ] registers are used to specify whether parity will be used, 

certain background functions when it is in the reduced clock lhe memory size for each bank of memory, refresh rates and 

speed state. By modifying the number of wait states in me number of wait states for the memory controller to 

accordance with the present invention the system completes 5S iniliate prior ^ informing CPU 112 that data satisfying a 

those background functions more quckly and therefore memory request is available from- DRAM 114, or will be 

moves to a completely idle state more quickly, conserving available during the next CPU cycle. This information is 

even more power. passed on to the CPU by means of lowering a NOT READY 

Other objects and advantages of the invention will line in control bus 132. The CPU, in the following cycle will 

become apparent upon reading the following detailed 60 then latch the data from DRAM 114 near the end of the 

description and upon reference to the drawings. cycle. 

BRIEF DESCRIPTION OF THE DRAWINGS „^ hen * c personal com P uler ^Yf cm is switched on, BIOS 

118 provides instructions to CPU 112 causing it to run 

FIG. 1 is a block diagram illustrating a typical personal through a series of system checks or "diagnostics" to test if 

computer system. 65 a number of essential system components exist and are 

FIG. 2 is a chart of processor speeds, DRAM speeds, and functioning properly. The BIOS also provides commands for 

the number of cycles required for reading and writing data. storage of parameters that are necessary in order for com- 
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puter programs to run. For instance, information such as may also be provided, and the control panel invoked by 

input/output ports, type of display monitor, system clock mouse or keyboard commands. 

speed, and number of wait states must be provided before a In a further embodiment, BIOS is shadowed in the DRAM 

personal computer system is operational. These parameters 114. In this embodiment, all access to the specified ROM 

can be modified as the software or hardware needs of a user $ space are redirected to the corresponding DRAM location, 

changes. Thus, the power conservation icon or command may result 

The memory access wait states for given CPU speeds is in commands from DRAM being executed to bring up the 

given in table form in FIG. 2. The first column of informa- control panel In this manner, BIOS from DRAM may be 

tion is the CPU speed, given in megahertz. Hie second executed by the CPU to provide the wait state control of the 

column is for the DRAM speed in providing data, the third 10 present invention. Thus, the invention may be embodied in 

column describes the number of CPU cycles that must pass several different storage medium, from ROM diskette, tape, 

prior to the data being available for use by CPU 114. Note CD and other magnetic and optical storage medium to 

that there are four numbers in the third column. The first transmission into storage via network where it may be 

number corresponds to the number of cycles for a first access temporarily stored in switches, satellites, wires and other 

in a row of DRAM 114 memory. The second, third and 15 transmission medium on its way to storage in a personal 

fourth numbers correspond to the number of cycles to wait computer or workstation, all as represented by memory 

for sequential access to data in the same row, but perhaps in system block 114. 

a different column. The fourth column in the table corre- Processor speeds are increasing much more rapidly than 

sponds to the number of wait states that a CPU should memory access speeds. The invention will also be useful as 

execute following the sending of a write request to memory 2 o applied to level 2 cache memory systems. Such cache 

controller 116. memory systems were originally designed to ensure that no 

In FIG. 3, the proper programming for a DRAM control processor wait states were required However, current and 

register indicated generally at 310 is shown. DRAM control future processor cycle times are increasing so much faster 

register 310 is one of many DRAM setup registers for than memory speeds, the invention will find great use with 

containing setup values for the DRAM 114 as described 25 res P ect to cache memories to increase the performance of 

above. An index value of 35 h is used by BIOS to address computers during power conservation modes which reduce 

and store data in the bit positions shown. Bits 7 and 6 of the cycle times. 

8 bit register contain the number of wait states which are In FT G. 4, a high level flowchart is shown describing the 
used by memory controller 116 to control its NOT READY steps involved in modifying the number of memory wait 
line 132 during a memory read operation. The values bits 7 30 states in response to changes in clock speed. Upon receiving 
and 6 may have is indicated under the bit positions in the a command to modify the power level or other command 
table, and refer back to the table in FIG. 2. Thus, a setting that results in a modification to the speed of the processor 
of "11" in bit positions 7 and 6 correspond to cycle times of clock, the CPU executes the routine shown starting at 410. 
5-4-4-4 with a CPU clock speed of 50 MHz in FIG. 2. This In a further preferred embodiment, a coprocessor or other 
means that data will not be ready for the CPU until the 5th 35 type of processing unit may execute the routine to modify 
cycle during a read operation on a new row of data, or until system parameters. As discussed above, a modifiable table 
the 4th cycle on subsequent consecutive access to the same may be presented to the user with options for either modify 
row. The number of wait states are one less than each of the the number of wait states directly, selecting a power con- 
cycle times, as the cycle times indicate the cycle in which the servation mode, or modifying the processor clock speed 
data will be available. Memory Controller 116 thus raises 40 directly at 412 when the initial command is received. At 414, 
NOT READY 124 for 4 cycle times, which causes the CPU, if the speed of the clock has been requested to be modified 
such as an Intel Corporation based x86 processor to execute by the initial command or subsequent user selection, new 
another T2 cycle or wait cycle /wait state, which essentially wait states corresponding to the new processor clock speed 
is a holding pattern of no real processing. During the 5th are determined by table lookup from the table in FIG. 2. At 
cycle, NOT READY is lowered, and the CPU latches the 45 416, if there is a request to decrease the speed of the clock 
data from DRAM 114 towards the end of the cycle. from the current setting, a register for controlling the clock 
In one embodiment, the control of the number of memory speed is set at 418, and the memory controller register is set 
wait states is done through a system management mode for the new wait states at 420. If there was not a request to 
(SMM) portion of BIOS. On start up of the computer, a user decrease the speed, it is determined at decision block 422 
is given a predetermined amount of time to select a control 50 whether or not the request was for an increase in the speed 
panel for informing the computer of configuration changes. of the clock. If yes, the memory controller register is set for 
This allows a user to provide information on new memory the new wait states as 422 and then the clock is set to the new 
or peripheral devices. The present invention provides some speed at 426. If there was no speed increase requested at 
additional parameters for the user to select. The user can 422, then there was no request to change the speed of the 
either select a different processor speed, or, for normal users 55 clock and control is returned at return 436. 
who do not need to understand, they can select a power In addition to the user of a . system directly selecting power 
conservation mode. Either of these selections will cause conservation which results in lower clock speed, the reduc- 
both the clock 120 to be reprogrammed to a different cycle tion in clock speed may also result from other methods of 
speed from that found in a CPU nonvolatile memory or from control. In one embodiment, the clock speed is reduced after 
the clock chip, and cause the number of wait states to be 60 a certain predetermined period of time of inactivity. If no 
modified in register 310 of memory controller 116. In a user commands are issued, or no applications are running, at 
further embodiment, the user may directly modify the num- a selectable time, perhaps 2 minutes, a command to conserve 
ber of wait states by changing them in a user modifiable power is issued to the wait state routine of FIG. 4. At the first 
table. A system management interrupt will then ensure that instance of a user command or an application being initiated, 
register 310 is reprogrammed correctly. Values outside the 65 a command is provided back to the wait state routine to 
range of acceptable values will be rejected. In addition to increase the clock speed, resulting in the wait states being 
access to the control panel at start up, an icon or command modified back. 
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While the invention was described in connection with 
various embodiments, it was not the intent to limit the 
invention to that emnbodiment. On the contrary, it is the 
intent to cover all alternatives, modifications and equivalents 
as may be included within the spirit and scope of the 5 
invention as defined by the appended claims. Many other 
embodiments will be apparent to those of skill in the art 
upon reviewing the above description. Accordingly, it is 
apparent that modifications can be made to the apparatus and 
method for using same without departing from the teaching 10 
of the present invention. 

What is claimed is: 

1. A method of improving performance in a power- 
managed computer system having a minimum number of 
memory access wait states, the method comprising the steps is 
of: 

(a) detecting a request to change a clock speed of said 
system; 

(b) determining if implementation of said request requires 
an increase in said clock speed; and 

(c) adjusting the minimum number of memory access wait 
states based on said request and prior to implementing 
said request. 

2. A method of improving performance in a power- 
managed computer system having a minimum number of 
memory access wait states, the method comprising the steps 
of: 

(a) detecting a request to change a clock speed of said 
system; 

(b) determining if implementation of said request requires 
a decrease in said clock speed; and 

(c) implementing said request; and 

(d) adjusting the minimum number of memory access 
wait states based on said request. 

3. A computer-readable memory containing a computer 
program for causing a computer to perform the following 
steps: 

(a) detect a request to change a clock speed of said system 

(b) determine if implementation of said request requires 
an increase in said clock speed; and 

(c) adjust the minimum number of memory access wait 
states based on said request and prior to implement- 
ing said request. 



20 



25 



40 



4. A computer-readable memory containing a computer 
program for causing a computer to perform the following 
steps: 

(a) detect a request to change a clock speed of said system 

(b) determine if implementation of said request requires a 
decrease in said clock speed; and 

(c) implement said request; and 

(d) adjust the minimum number of memory access wait 
states based on said request. 

5. A computer system comprising: 
a central processing unit; 

a memory for storing programs and data; 
a clock having at least two clock speeds; and 
a memory controller coupled to the central processing 
unit, the memory, and the clock for accessing the 
memory in response to a request from the central 
processing unit to change the clock speed, for deter- 
mining if the request requires an increase in said clock 
speed, and for adjusting the minimum number of wait 
states prior to implementing the request, before inform- 
ing the central processing unit that the data is available. 

6. The computer system of claim 5, wherein the memory 
controller further comprises a register which is loadable with 
a number representing the clock speed dependent number of 
wait states by the central processing unit. 

7. A computer system comprising: 
a central processing unit; 

a memory for storing programs and data; 
a clock having at least two clock speeds; and 
a memory controller coupled to the central processing 
unit, the memory, and the clock for accessing the 
memory in response to a request from the central 
processing unit to change the clock speed, for deter- 
mining if the request requires a decrease in said clock 
speed, and for implementing the request prior to adjust- 
ing the minimum number of wait states, before inform- 
ing the central processing unit that the data is available. 
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